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Abstract 

Nonsense-mediated mRNA decay (NMD) causes accelerated transcript degradation when a premature translation 
termination codon disrupts the open reading frame (ORF). Although endogenous transcripts that have uninterrupted ORFs 
are typically insensitive to NMD, some can nonetheless become prone to NMD when translation terminates at out-of-frame 
premature stop codons. This occurs when introns containing stop codons fail to be spliced, when translation of an upstream 
ORF (uORF) terminates in the 5'-untranslated region (5'-UTR) or the coding region, or when the 5'-proximal AUG initiation 
codon is bypassed and translation initiates at a downstream out-of-frame AUG followed by a stop codon. Some 3'- 
untranslated regions (3'-UTRs) are also known to trigger NMD, but the mechanism is less well understood. To further study 
the role of 3'-UTRs in NMD, a reporter system was designed to examine 3'-UTRs from candidate genes known to produce 
NMD-sensitive transcripts. Out of eight that were tested, the 3'-UTRs from MSH4 and SP016 caused NMD-dependent mRNA 
destabilization. Both endogenous genes produce multiple transcripts that differ in length at the 3' end. Detailed studies 
revealed that the longest of six reporter MSH4-3'-UTR transcripts was NMD-sensitive but five shorter transcripts were 
insensitive. NMD-dependent degradation of the long transcript required Xrnl, which degrades mRNA from the 5' end. 
Sensitivity to NMD was not associated with extensive translational read-through past the normal stop codon. To our 
knowledge, this is the first example where multiple transcripts containing the same ORF are differentially sensitive to NMD 
in Saccharomyces cerevisiae. The results provide a proof of principle that long 3'-UTRs can trigger NMD, which suggests a 
potential link between errors in transcription termination or processing and mRNA decay. 
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Introduction 

Nonsense mediated mRNA decay (NMD) is an mRNA quality 
control mechanism that was first discovered in Saccharomyces 
cerevisiae and was later found to be ubiquitous throughout 
eukaryotes [1,2,3]. NMD prevents the accumulation of truncated 
proteins produced from defective transcripts. Base substitutions 
cause premature termination of translation whenever a sense 
codon is changed to a stop codon. In AT-rich genomes, multiple 
stop codons reside in all of the alternate reading frames of virtually 
every gene. For this reason, most frameshift mutations bring a 
premature termination codon (PTC) into register. NMD monitors 
the translatable RNA population through a process called RNA 
surveillance, resulting in the elimination of RNAs that, because of 
a coding error, could produce potentially deleterious truncated 
proteins. Splicing errors can also result in the inclusion of a PTC in 
the coding region, which targets the mis-spliced transcript for 
decay by the NMD pathway. 

In S. cerevisiae, nonsense transcripts are targeted for NMD by a 
surveillance complex consisting of five proteins: Upfl, Upf2, Upf3, 
eRFl and eRF3 [4,5,6]. Upfl is part of a superfamily 1 ATP- 
dependent RNA helicase that localizes to the cytoplasm [7] . When 
ribosomes encounter a premature stop codon, Upfl interacts with 



eRFl and eRF3 to terminate translation [5,8]. Upf2 and Upf3, 
which are involved in selective recruitment of nonsense transcripts 
into the NMD pathway, form a separate sub-complex [9] . Each of 
the two sub-complexes can interact with nonsense transcripts 
independendy but both must be present for the surveillance 
complex to form and trigger NMD. 

In yeast, the predominant mode of NMD-mediated transcript 
degradation occurs when Upfl interacts with the decapping 
complex [10]. mRNA remodeling is believed to expose the cap, 
which triggers cap removal without the usual requirement for 
shortening of the poly(A) tail. Upfl associates with ribosomes and 
plays a role in translation termination at PTCs as a prelude to 
remodeling, de-capping, and decay [11]. The mRNA is then 
degraded from the 5' end by the exonuclease Xrnl. In a slower 
alternative pathway for NMD-mediated degradation, Upfl 
interacts with Ski7, a component of cytoplasmic exosomes [12]. 
Following enhanced degradation of the poly(A) tail to 7-20 A 
residues, cytoplasmic exosomes degrade the body of the mRNA 
from the 3' end [12]. 

In addition to RNA surveillance, NMD also affects the 
accumulation of transcripts produced from 458 endogenous genes 
in S. cerevisiae containing ORFs that are uninterrupted by an 
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in-frame PTC [13]. 220 of these mRNAs are direct targets of 
NMD where the changes in accumulation are caused by a change 
in the decay rate. The remaining transcripts that show NMD- 
dependent changes in accumulation are affected indirectly and 
show no change in decay rate. 

Three mechanisms have been described that trigger nonsense- 
mediated decay of direct targets. Two of these bring out-of-frame 
stop codons into register where they are recognized as PTCs that 
trigger NMD. Translation of upstream open reading frames 
(uORF) can trigger NMD when uORF termination occurs either 
in the 5' leader or at an out-of-frame stop codon within the 
primary ORF [13,14]. Alternatively, if translation initiation is 
inefficient at the normal start codon, typically the first AUG, 
ribosomes bypass the first AUG and continue scanning to the next 
AUG. If the second AUG is out-of-frame, ribosomes initiate 
translation in an alternative reading frame and terminate 
translation at an out-of-frame premature stop codon [13,15]. In 
the third mechanism, some transcripts are targeted for NMD by 
the 3 '-untranslated region (3'-UTR) [16,17]. Although the details 
are poorly understood, it has been shown that transcripts with 
unusually long 3'-UTRs are prone to NMD, which might cause 
the normal stop codon to be recognized as a PTC [16,17]. 

In this study we focused on the role of the 3'-UTR in NMD. We 
developed a reporter system to screen for 3'-UTRs that are 
required to trigger NMD. 3'-UTRs from two genes known to be 
targets of NMD were identified. In both cases, the genes produce 
heterogeneous transcripts that differ in the locations of the 3' ends. 
Our results show that multiple transcripts containing the same 
ORF can be differentially susceptible to NMD. The length of the 
3'-UTR appears to be the determining factor. While alternative 
3'-UTR splicing in mammals has been shown to lead to variations 
in NMD sensitivity, to our knowledge this is the first known 
example of 3'-UTR variability altering NMD sensitivity in 
budding yeast suggesting this mechanism may be used across the 
domain Eukaryota [18]. 

Materials and Methods 

Yeast strains 

Experiments were performed using strains W303A (MATa 
ura3-l his3- 11,15 leu2-3,112 trpl-1 ade2-l canl-100 upfl- 
?2:URA3); AAY320 (MATa leu2-3,112 trpl-1 canl-100 ura3-l 
ade2-l his3-ll,15), and BY4741 (MATa his3zfl, leu2 met 15 
ura3), the parent strain used to create the yeast knockout 
collection. Strains W303A and AAY320 was the parental control 
in experiments involving XRN1 and SKI7 knockouts. In some 
experiments, NMD-dependent changes in mRNA accumulation 
and decay were measured in derivatives of strain W303A because 
it was shown that the magnitudes of NMD-dependent changes are 
greatest in this strain [19]. 

5' and 3' mRNA decay pathways were analyzed using Nmd + 
and Nmd~ strains carrying null alleles of XRN1 and SKI7 as 
follows: BZY18 {MATa ura3-l his3-ll,15 leu2-3,112 trpl-1 ade2-l 
canl-100 upfl-A2:URA3 ski7:ADE2); BZY19 {MATa ura3-l his3- 
11,15 ku2-3,112 trpl-1 ade2-l canl-100 upJl-A2:URA3 xml:ADE2); 
BZY20 {MATa leu2-3,112 trpl-1 canl-100 ura3-l ade2-l his3-ll,15 
ski7:ADE2); and BZY21 (MATa leu2-3,112 trpl-1 canl-100 ura3-l 
ade2-l hu3-ll,15 xrnl:ADE2). 

Criteria for selecting candidate genes for the 3'-UTR 
screen 

To identify genes whose transcripts are most likely targeted for 
NMD through the 3'-UTR, we analyzed time-course microarray 
data describing mRNA half-lives that were used previously to 



identify and classify NMD targets [13]. Out of 458 NMD-sensitive 
mRNAs, 238 were excluded as candidates for 3'-UTR targeting 
because they were indirect targets, exhibiting altered abundance 
but unchanged half-lives in the absence of NMD. Among the 220 
direct targets, ~75 were predicted to be targeted for NMD due to 
a translatable uORF and ~50 due to bypass of the first AUG. 
Both invoke out-of-frame stop codons to terminate translation 
prematurely, and some transcripts might be targeted by both 
mechanisms. Among the direct targets of NMD, — 100 candidates 
mRNAs remained that could be targeted by an alternative 
mechanism, possibly one that involves the 3'-UTR. We setded on 
a relatively small number of candidates because of time constraints 
in constructing reporters to test the 3'-UTRs for a role in NMD. 
Eight candidates {MSH4, SP016, HMOl, PET18, TMR114Q 
CJW1, STE2, and DATS), were selected for analysis. The 3'-UTRs 
from PGA1, which is known to be targeted for NMD through the 
3'-UTR, and ACT1, which is known to be insensitive to NMD, 
were included as positive and negative controls, respectively [16]. 

For each candidate, an estimate was required for the length of 
the 3'-UTR before constructing reporters. Unlike mammals, S. 
cerevisiae genes lack a conserved transcription termination and 
polyadenylation signal sequence. We examined tiling microarray 
array data to estimate lengths of the 3'-UTRs [20]. Centromeric 
reporter plasmids were used to screen the effects of the 3'-UTRs 
on mRNA accumulation. The reporters express a hybrid gene 
inserted in plasmid pRS313 consisting of the CUP1 transcriptional 
promoter (copper metallothionine) fused to the ORF coding for 
green fluorescent protein (GFP) followed by a 3'-UTR of choice. 

3'-UTRs from candidate genes were inserted into the CUPT 
GFP reporter between EcoKl and Sad sites such that the GFP stop 
codon and downstream sequences were displaced by an insert 
containing the stop codon and 3'-UTR of the candidate gene. The 
plasmids containing 3'-UTRs were pJZ2 (753-nt 3'-UTR from 
PGA1); pJZ3 (345-nt from HMOl); pJZ4 (86-nt from PET18); pJZ5 
(63-nt from TMR114Q; pJZ6 (290-nt from ACT1); pJZ7 (133-nt 
from SP016); pJZ9 (77-nt from CAW); pJZIO (227-nt from 
MSH4); pJZll (272-nt from STE2); and pJZ12 (1520-nt from 
DATS). 

Translational read-through assay 

To analyze translational read-through of MSH4 transcripts, 
plasmids expressing the CUP1 -GFP-MSH4-3' -UTR reporter were 
constructed that contain a FLAG epitope tag inserted following 
the TAA stop codon (pJZ13) and FLAG combined with a 
mutation that changes the TAA stop codon to a TTA leucine 
codon (pJZ14). pJZ15 and pJZ16 are multi-copy versions of pJZ13 
and pJZ14, respectively, that carry the 2 \l origin of replication. 

RNA methods 

RNA was isolated from saturated overnight cultures of cells 
diluted to OD 600 = 0. 1 and to grown to midlog phase, OD 60 o — 
0.4—0.5. During outgrowth the CUP1 promoter was induced with 
100 uM CuS0 4 for 1.5-3 h. To assay mRNA accumulation, 
20 ml of cells were centrifuged for 3 min at 3200G after which the 
cells were resuspended in 0.7 ml RNA lysis buffer (10 mM Tris-Cl 
pH 7.4, 10 mM EDTA, 0.5% SDS) and flash frozen in a dry ice 
ethanol bath. RNA was prepared by adding 800 |xl of acid 
saturated phenol pH 4.3 (Fisher) and 150 ul of RNA lysis buffer to 
frozen cell pellets. To assay mRNA decay, transcription was 
inhibited prior to RNA isolation by the addition of 25 ug/ml 
thiolutin (a gift from Pfizer, part number CP-4092) [13]. RNA was 
pelleted and flash frozen at indicated time intervals. RNA 
extracted at time intervals following the addition of thiolutin was 
prepared by adding 750 ul of acid saturated phenol and 600 JLXl of 
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RNA lysis buffer to frozen cell pellets. RNA samples used to assay 
mRNA accumulation and decay were incubated at 65 °C for 
45 min with vortexing every 10—15 min. Samples were centri- 
fuged at 4°C in a microcentrifuge at maximum speed for 15 min 
to pellet debris. The aqueous phase was subjected to a second acid 
phenol extraction. Residual phenol was removed from the 
aqueous phase by vortexing with 750 JLXl chloroform for 2 min 
and centrifuging for 2 min. The RNA was precipitated by adding 
NaOAc pH 5.3 to a final concentration of 0.3 M, 2.5 volumes of 
100% ethanol and pelleted by centrifugation for 30 min at 4°C. 
Pellets were washed with 70% ice cold ethanol, dried, and 
resuspended in TE buffer. 

RNA was fractionated on a 1 % agarose gel and transferred to 
GeneScreen Plus (PerkinElmer Life Sciences). Transcripts were 
analyzed by northern blotting using sequence-specific probes 
labeled with 32 P (PerkinElmer BLU008H250UC and 
BLU007Z001MC) and prepared by in vitro transcription. The 
non-coding RNA SCR1 was used as a loading control. Northern 
blots were exposed to phosphorimager screens (Molecular 
Dynamics) scanned by a Typhoon 9200 (GE Healthcare, Life 
Sciences) and analyzed using Image Quant 5.2. All reported values 
are the result of at least three replicates. 

MSH4 transcripts were analyzed by RNase H treatment. 
Samples were prepared by combining 25 u.g RNA, 3.11 |J,g 
poly(T) oligo (T 20 by IDT) and 10 ug internal GFP cleavage 
probe (5' ACA GGG CCA TCG CCA ATT GGA GTA TT) in a 
final volume of 25 ul. Samples were heated to 65°C for 10 min 
and allowed to cool to room temperature for 10 min after which 
3 ul 10 x reaction buffer and 2 ul RNase H (New England 
Biochemicals) were added. Samples were incubated at 37°C for 
30 min at which point reactions were quenched by placing the 
samples on ice and adding an equal volume of urea loading buffer 
(8 M Urea, 2 mil Tris, pH 7.5, 20 mM EDTA). Samples were 
fractionated on a 6% denaturing polyacrylamide gel, transferred 
to GeneScreen Plus and analyzed by northern blotting as 
described above. 

RT-PCR-Seq 

40 ug RNA and 6 ug poly T-oligo were combined and brought 
up to a volume of 3 1 ul with TE. The reaction was incubated at 
70°C for 10 min and chilled on ice for 1-2 min. A 60 ul reaction 
was carried out with Superscript III (Invitrogen) with a final 
concentration of 1 x reaction buffer 10 mM DTT, 50 uM each 
nucleotide and 400 U Supercript III RT. Reverse transcription 
was allowed to proceed for 2 h at 50°C. RNA was hydrolyzed by 
adding NaOH to a final concentration of 200 mM, EDTA to 
50 mM and incubating at 65°C for 15 min. Reactions were 
neutralized by adding HEPES pH 6.5 to a final concentration of 
333 uM. The fragments of interest were amplified using standard 
PGR methods. PCR fragments were gel-purified and re-amplified 
using a poly(T) oligo and a nested primer to remove artifacts. 
Products were sequenced at the University of Wisconsin Madison 
Biotechnology Center Sequencing Facility. 

Protein isolation and western blotting 

Saturated overnight cultures of cells were diluted to OD 6M = 
0.1 and allowed to grow to midlog phase, OD 6M = 0.4—0.5. 
During outgrowth, cells were induced with 100 ul CuS0 4 , washed 
with distilled water, and centrifuged. The supernatant was poured 
off and the pellet was flash frozen. Cell pellets were lysed by 
vortexing with glass beads and lysis buffer (50 mM Tris 7.4, 
150 mM NaCl, 1 mM EDTA, 0.1% Tween 20, protease inhibitor 
cocktail (Sigma P8215), 1.742 mg/ml PMSF and 1.566 mg/ml 
benzamidine). Protein concentration of the clarified lysate was 



calculated using the Bradford method with a BSA standard. 
Samples were fractionated using a SDS-PAGE gel with a 12.5% 
resolving layer. Protein was transferred to Amersham Hybond 
ECL 0.45 fxM (GE Healthcare). After blocking for 1 h with 1:1 
PBS:Odyssey blocking buffer (LI-COR biosciences). Membranes 
were probed overnight with 1:1 PBS:Odyssey blocking buffer and 
0.1% Tween-20 with both rabbit anti-GFP (Sigma-Aldrich 
G1544) diluted 1:5,000 and mouse anti-FLAG M2 (Sigma 
F1804) diluted 1:3,000. After washing in lxPBS and 0.1% 
Tween-20 membranes were probed with both goat anti-rabbit 
IRDye 680LT (LI-COR 926-68021) and goat anti-mouse IRDye 
800CW (LI-COR 926-32210) antibodies at 1:25,000 dilution in 
1:1 PBS:Odyssey blocking buffer with 0.1% Tween-20 and 0.01% 
SDS. Membranes were then washed, allowed to dry and scanned 
using the Odyssey Imaging system and data were analyzed using 
the corresponding Image Studio software v2.1.10 (LI-COR). 

Results 

Identification of 3'-UTRs that function in NMD 

To find 3'-UTRs that play a role in NMD, a reporter system 
was designed. The reporter mRNA is transcribed from a gene 
fusion consisting of the CUP1 (copper metallothionine) promoter, 
the ORF coding for green fluorescent protein (GFP), and a ?>'- 
UTR of choice (Fig. 1A). The reporter includes the stop codon 
from the 3' -UTR to be tested rather than the GFP stop codon so as 
preserve as much as possible the normal context for translation 
termination. The CUP1 promoter and the GFP coding region are 
insensitive to NMD, enhancing the likelihood that NMD- 
dependent changes in mRNA accumulation and decay will be 
functionally linked to the 3' -UTR. Detection of NMD targets, 
normally present in low abundance, is enhanced by CUP1 -driven 
expression to high levels in the presence of exogenous copper. To 
our knowledge, there are no reported examples where over- 
expression of any gene limits NMD. 

After considering genomic data to select which 3'-UTRs to test 
(Materials and Methods), we settled on eight: MSH4, SP016, 
HMOl, PET18, TMR114C, CNN1, STE2, and DAL5. All of the 
candidates were previously shown to be sensitive to NMD during 
vegetative growth, including MSH4 and SP016, which are known 
genetically to function in meiosis and sporulation [21,22,23]. 

Northern blotting was used to reveal NMD-dependent changes 
in reporter mRNA accumulation. Two of the eight 3'-UTRs 
(MSH4 and SP016] produced an altered pattern of transcripts 
when wild-type and upflA strains were compared (Fig. IB). RNA 
species accumulated in the Nmd~ strains expressing the MSH4 
and SP016 reporters (Fig. IB, arrows). As shown in subsequent 
experiments, the longer RNAs are transcriptional read-through 
products that are sensitive to NMD. In the absence of NMD, the 
NMD-sensitive read-through transcript for MSH4 increased from 
12.3% of total transcript to 36.6%, a three-fold change. For 
SPOl 6, the NMD-sensitive read-through transcript increased from 
4.6% to 17.8%, a four-fold change (Fig 1C). 

The MSH4 3'-UTR plays a role in NMD 

To ensure that changes in accumulation were related specifi- 
cally to NMD, the experiments for the MSH4 reporter were 
repeated in strains carrying a^/?zlderived from strain BY4741. If 
the inactivation of UPF1 and UPF2 have similar effects, then the 
effects are most likely due to the loss of NMD. Both the upflA and 
upJ2A strains showed nearly identical increases in accumulation of 
the long MSH4 transcript (Fig. 2A). As expected, the magnitudes of 
NMD-dependent differences in accumulation were lower in the 
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Figure 1. The 3'UTR of MSH4and SP016 produce transcription read-through products in an NMD dependent manner. (A) A reporter 
system was constructed on a centromeric plasmid. The reporter consisted of the 3 ' UTR of genes of interest fused to GFP under control of the copper 
inducible CUP1 promoter. (B) Eight genes were examined by northern blot in a BY4741 background with both a wild type and Aupfl strain. The 
3'UTR of PGA1 is known to target transcripts for NMD and ACT1 is known to be insensitive to NMD. Both were included as controls [16]. Only the 
MSH4 3'UTR and SP016 3'UTR displayed accumulation of a longer transcript, which are indicated by arrows. (C) The relative accumulation of the 
longer transcript was measured in a W303A background. Relative accumulation was calculated as the percentage of the longer transcript relative to 
total transcripts. Strain dependent differences in the magnitude of NMD-dependent accumulation in BY4741 and W303A background is consistent 
with previously published results [19]. All values are the result of averaging a minimum of 3 replicates. 
doi:1 0.1 371 /journal.pone.0080981 .g001 



BY4741 strains (Fig. 2) compared to W303A (Fig. 1) (two-fold 
versus three-fold, respectively) [19]. 

The 3' end of the long transcript was mapped using RT-PCR- 
Seq (Materials and Methods). The RNA ends in a cluster of stops 
starting 1 1 70-nt downstream of the GFP stop codon. It extends 
past the end of the MSH4 3'-UTR at 224 nucleotides into plasmid 
DNA, where it ends in the 3 '-proximal part of the HIS3 coding 
region. The HIS3 gene is located on the plasmid downstream of 
CUP1 -GFP-MSH4-3' - UTR reporter and is transcribed in the 
opposite direction. Our results define the long RNA as a 
transcriptional read-through product. 



The CUP1-GFP-MSH4-3'-UTR reporter produces multiple 
transcripts that are differentially susceptible to NMD 

Previous genomic data indicated that the endogenous MSH4 
gene is expressed during vegetative growth and produces a 
transcript that is sensitive to NMD [13]. Before proceeding 
further, northern blotting was used to assess the levels of 
expression and to verify that MSH4 mRNA is both expressed 
and sensitive to NMD in vegetative cells. A 1.6 ± 0.3 fold increase 
(p = 0.02) in endogenous MSH4 mRNA accumulation was 
observed in a upflA strain compared to a wild-type strain 
(Fig. 2B), indicating that MSH4 is a likely target of NMD. This 
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Figure 2. The long transcriptional read-through product from 
GFP-MSH4-3' UTR accumulates in an NMD-dependent manner. 

(A) The accumulation of the long GFP-MSH4 3'UTR construct was 
examined in a BY4741 background. The longer transcript accumulates 
to the same level in both a Aupfl and Aupf2 strain verifying the 
observed effect is NMD dependent. (B) The endogenous MSH4 gene 
shows an increase of steady state mRNA in an NMD deficient strain, 
confirming it as a target of NMD. 
doi:1 0.1 371 /journal.pone.0080981 .g002 

is consistent with previous NMD target studies, which define a 
target of NMD as an mRNA which shows a 1 .5 fold or greater 
increase in mRNA accumulation when NMD is inhibited [13]. 

The expression levels of the endogenous gene were very low 
compared to the reporter system. In the northern blot in Fig. 2B, 
only one band was detected corresponding to transcripts from the 
endogenous gene. No equivalent to the long transcript produced 
from the reporter was detected, but that was expected since the 
long transcript from the reporter extends past the end of the 3'- 
UTR and into the plasmid sequences. If a long transcript of the 
endogenous gene is present, it accumulates at a level below 
detection. We opted to use the copper-inducible reporter system in 
further studies to over-express the transcription products and more 
conveniendy assess the role of the 3' -UTR in targeting, but we 
recognize that transcripts produced from the reporter could differ 
in length compared to endogenous transcripts. 

Using the reporter system, we measured the half-lives of the 
long and short transcripts by using thiolutin to inhibit transcription 
followed by a time course northern blot to examine the rate of 
disappearance of RNA present at the time of transcription 
inhibition. By plotting relative RNA remaining versus time, the 
RNA half-life can be determined (Fig. 3). The long transcript had 
a two-fold increase in half-life when NMD was inactivated (5.19 ± 
0.69 min in wild-type versus 11.89 ± 2.1 min in a upfl A strain) 
(Fig. 3A). The difference in half-life was similar to the change in 
accumulation at t = 0. This result confirms that the long transcript 
is a target of NMD. When the short transcript was examined, no 
differences in accumulation at t = 0 or half-life were observed 
(Fig. 3B). The half-lives were 5.47 ± 0.29 min in wild-type versus 
6.28 ± 0.66 min in a upflA strain), suggesting that the short 
transcript is not a target of NMD. 

The short transcript is a composite of five NMD- 
insensitive RNAs 

Heterogeneity was revealed in the short transcript detected as a 
single band on a 1 % agarose gel. The source of the heterogeneity 
was assessed by combining an RNase H cleavage assay with 
fractionation on a 6% polyacrylamide gel (Materials and 
Methods). By using a DNA oligonucleotide that hybridizes to 
the 3' end of GFP and a poly(T) oligonucleotide that hybridizes to 
the poly(A) tail, truncated versions of the transcripts were 
generated by RNase H cleavage of DNA:RNA duplexes to 
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produce RNAs that have the same 5' end and lack a poly(A) tail. 
Any size differences reflect differences in length at the 3' end. 

Using this assay, five transcripts, designated SI through S5 
(Fig. 4), were detected in Nmd and Nmd strains expressing the 
MSH4 reporter (Fig 4A). The resolution of bands was sufficient to 
separately measure changes in accumulation when NMD is 
inactivated. None of the five RNAs showed any NMD-dependent 
difference in accumulation (Fig. 4B), where accumulation was 
measured as a percentage of total short transcripts (all five RNAs 
combined). The results show that the half-life of the short 
transcript detected on agarose gels, which was unchanged when 
NMD was inactivated (Fig. 3B), is an aggregate half-life of five 
RNAs with different 3' ends. Since none of the five RNAs show a 
NMD-dependent change in abundance, we conclude that none of 
the short RNA family members are targets of NMD. 

The 3' end of the shortest transcript was mapped using RT- 
PCR-Seq (Materials and Methods). The shortest RNA has a 3' 
end 123 nucleotides from the translational stop codon in the 
MSH4 reporter. The 3' ends of the other transcripts were 
estimated by comparing relative migration on gels to the migration 
of known size markers (Fig. 4C). Transcripts SI, S2, and S3 end 
within the 224 nt-long MSH4 3'-UTR (Fig. 4D). Transcripts S4 
and S5 extend into downstream plasmid DNA by 28 and 55 
nucleotides respectively. As described earlier, the long NMD- 
sensitive transcript (LI) detected on agarose gels ends at nt 1170 
(Fig. 4C). It extends 943-nt into plasmid DNA and ends in the 3' 
proximal part of the HIS3 ORF. 

NMD recruits the L1 transcript into the 5' mRNA decay 
pathway 

In S. cerevisiae, transcripts are degraded by two pathways called 
5' and 3' decay. In 5' decay, the 5'-cap is removed and RNA is 
degraded from the 5' end by Xrnl [24,25]. The prevailing view is 
that the RNA helicase Upfl re-models the RNA exposing the cap 
while simultaneously interacting with the de-capping enzyme. 
Consistent with this mechanism, NMD promotes de-capping and 
5' decay regardless of poly(A) tail length [25]. In 3' decay, the 
RNA is degraded from the 3' end by the exosome. Upfl interacts 
with Ski7, a component of the exosome [26]. Through the 
interactions of Upfl with components of both decay pathways, 
NMD can activate either pathway depending on the target. 

To determine which pathway predominates when the reporter 
transcript LI is degraded by NMD, we used mutants to inactivate 
NMD and either the 5' or the 3' decay pathway. Null xrnl 
inactivates 5' decay. Null ski7 inactivates 3' decay. Null upfl 
inactivates NMD. We analyzed the effects of blocking a decay 
pathway by northern blotting of RNA from strains carrying each 
of these single mutants and xrnl A upfl A and ski7A upfl A double 
mutants (Fig. 5). 

The LI transcript accumulated to a similar level in both the 
Axml and Aupfl mutants {p-Q.l&j. The Axrnl Aupfl and Aski7 
Aupfl double mutants also had accumulation levels for LI that 
were statistically indistinguishable from the Aupfl mutant (p = 0.82 
and p = 0.87, respectively). However, the Aski7 mutant showed a 
statistically significant difference in accumulation of LI compared 
to the Aupfl strain (p = 0.001). We conclude that when NMD 
targets the LI transcript for rapid decay, the 5' decay pathway is 
activated predominandy over the 3' decay pathway. 

Translation of the reporter transcripts 

Premature translational stop codons play a role when NMD 
targets a transcript for rapid decay. When transcripts are targeted 
through translation of an upstream ORF or inefficient translation 
initiation, PTCs brought into frame trigger NMD. We tested the 



PLOS ONE | www.plosone.org 



5 



November 2013 | Volume 8 | Issue 11 | e80981 



Role of 3-UTR's in NMD 



A) 1 

0.9 



Time(min) Half Life 

0 5 10 20 30 40 (mln) 




< 0.5 

« 04 

| 0.3 
to 

| 0.2 
0.1 



. WT 

•Aupfl 



0 5 10 15 20 25 30 35 40 45 
Time (min) 



Time(min) Half Life 

0 5 10 20 30 40 (mm) 




20 30 
Time (min) 



C) 



Time (min) 
WT 

Aupfl 




Figure 3. The long transcript from GFP-MSH4-3'UTR shows increased stability in NMD-deficient strains. Representative half-life 
measurements in a W303A background are shown for both (A) the longer transcript product and (B) the shorter transcript product. Inlay depicts the 
sample being represented in the graph and is adjacent to the corresponding marker. Only the longer transcript shows increased stability in an NMD 
deficient strain when transcription is inhibited by the addition of thiolutin. 
doi:1 0.1 371 /journal.pone.0080981 .g003 



model that long 3'-UTRs target transcripts for NMD because the 
normal stop codon is recognized as a PTC. Translation 
termination at a PTC has been shown to be less efficient in the 
absence of NMD as evidenced by the accumulation of transla- 
tional read- through products [27]. If the normal stop codon is seen 
as premature, then proteins that have C-terminal extensions 
should accumulate. 

To test the model, we developed an assay to detect proteins that 
end at the normal stop or that extend beyond the normal stop 
codon. A FLAG tag was inserted after the normal stop codon. As a 
positive control, the normal stop codon was mutated to a sense 
codon, Leu (Tig 6A). The FLAG tag was positioned three codons 
downstream of the normal stop codon and two codons upstream of 
the next stop codon to minimize interference with translation 
termination. To ensure that the FLAG sequence had no effect by 
itself on NMD targeting, we used northern blotting to measure 



accumulation of RNA produced from the mutant reporters 
(Fig. 6B). Neither the insertion of the FLAG sequence nor the 
mutation changing the stop codon to a sense codon altered the 
relative accumulation of the LI transcript in wild type and Nmd 
strains. 

Western blotting was used to assay for read-through past the 
normal stop codon. The expectation was that inactivation of 
NMD by upflA might result in increased read-through. By using 
secondary antibodies that were fluorescently labeled with two 
different fluorophores, it was possible to analyze a single blot 
simultaneously for the presence of both GFP and FLAG (Fig. 1C). 
Four strains were assayed: two Nmd + strains that have the normal 
stop codon or a sense codon in its place, and a similar set of two 
Nmd strains. GFP was expressed in all of the strains. The 
immediate downstream stop codon after the FLAG tag is an 
efficient terminator since only one band was detected when 
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Figure 4. Five short transcripts are not targets of NMD. (A) After treatment with RNase H the shortest transcript observed on a 1% agarose gel 
can be resolved into 5 distinct bands on a 6% polyacrylamide gel. (B) Relative accumulation of the five short transcripts. Values in WT and Aupfl 
columns represent each transcript as a percentage of the sum of the first five transcripts. (C) Sizes of all observed transcripts from northern blots 
performed on polyacrylamide gels (transcripts 1-5) and agarose gels (transcript B). Sizes were determined based on a combination of RT-PCR and 
estimates based on relative migration in gels to known size markers. (D) Schematic representing the transcription products of the GFP-MSH4 3'UTR 
construct (not to scale). The plasmid DNA fragment is 710 bases and contains prokaryotic DNA. The HIS3 gene is located on the crick strand relative to 
the GFP coding region. 
doi:1 0.1 371 /journal.pone.0080981 .g004 



examining FLAG or GFP. FLAG-tagged protein was detected 
when the normal stop codon was mutated to a sense codon. In 
Nmd- strains, no FLAG-tagged protein that extends past the 
normal site for translation termination was detected. 

To increase the sensitivity of the experiment, we switched from 
a single-copy centromeric plasmid to a multi-copy 2 |X plasmid 
that over-expresses the reporter RNAs on a per cell basis. Using 
the centromeric plasmid system, we can detect 8-10% read- 
through. Using the 2 u, plasmid, sensitivity improves to about 1 % 
read-through. These read-through values for theoretical limits of 
detection were calculated by comparing the signal for sense codon 
construct, defined as 100% read- through, with background, 



WT Aupfl Axrnl Aski7 



Axrnl Aski7 
Aupfl Aupfl 



Accumulation 12.3 36.6 
Error 5.1 13.4 




30.3 
1.7 



Figure 5. GFP-MSH4-3' UTR transcripts are degraded from the 5' 
ends. The steady state mRNA accumulation levels were measured in a 
W303A background in Axrnl, Aski7, Axrnl Aupfl and Aski7 Aupfl 
strains. Only the Aski7 mutant showed a statistically significant 
difference in accumulation from a Aupfl mutant. (Fig 1C) indicating 
that Upfl and Xrn1 are both part of the same degradation pathway for 
3'UTR-mediated NMD. 
doi:1 0.1 371 /journal.pone.0080981 .g005 



defined as 0% read-through. Furthermore, the 1% threshold for 
2 |X plasmid expression is based on translation of all reporter 
transcripts. Since Nmd-sensitive LI comprises about one third of 
total reporter transcripts in Nmd - strains, read-through would 
have to occur at about 3% or more in order to be detected. No 
read-through proteins were detected (data not shown). 

In all of these experiments the accumulation levels observed for 
tagged proteins suggest that any read-through products that might 
arise should have the same stability as the normally terminated 
protein. It is therefore unlikely that read-through proteins are 
made and then degraded. Although the normal stop codon is 
considered the best candidate to trigger NMD, the associated 
translational read-through typical of PTCs in Nmd strains either 
does not occur with the normal stop codon or read-through is 
below the 3% threshold. 

Discussion 

To identify transcripts targeted for NMD through the 3' -UTR, 
we fused the 3'-UTRs from candidate genes with green fluorescent 
protein (GFP) expressed from a copper-inducible CUP1 promoter 
on a centromeric plasmid. Northern blotting was used to monitor 
transcript length and abundance in Nmd + and Nmd - strains. The 
most likely candidates for 3'-UTR-mediated targeting were 
gleaned from genomic data identifying direct targets of NMD 
(Guan et al., 2006). After subtracting those known or suspected of 
being targeted without 3' -UTR involvement and by considering 
the predicted length of the 3'-UTRs, we settled on a short list of 
eight candidate genes. Two of the 3'-UTRs targeted a reporter 
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Figure 6. 3'UTR targeting is not associated with extensive translational read-through. (A) A FLAG tag was inserted three codons 
downstream of the normal stop codon of the GFP-MSH4 3'UTR construct to monitor translational read-through. The normal TAA (stop) codon was 
also mutated to the sense codon TTA (Leu) as a positive control. The * indicates a stop codon. (B) Northern blots were performed to measure relative 
accumulation of the NMD sensitive transcript in a wild type and NMD deficient strain. The presence of the FLAG sequence or stop^sense mutation 
did not affect relative accumulation of the mRNA. (C) Western blot of the GFP-FLAG-MSH4 3'UTR fusion. No translational read-through product was 
detected in the wild type or NMD deficient strain. 
doi:10.1371/journal.pone.0080981.g006 



transcript for NMD (MSH4 and SP016). Both reporters produced 
long NMD-sensitive read-through transcripts. The remaining six 
3'-UTRs failed to target the reporter transcripts for NMD possibly 
due to false discoveries and misclassifications of direct and indirect 
targets of NMD inherent in the genomic data, erroneous 
predictions of the targeting mechanism for a given transcript, or 
erroneous predictions of 3'-UTR length leading to reporters with 
incomplete 3'-UTRs. The detection of both NMD-sensitive and 
NMD-insensitive MSH4 3'-UTR transcripts provides unique 
internal controls permitting examination of stop codon context, 
3'-UTR sequence requirements, and 3'-UTR length requirements 
for NMD targeting. 

Since yeast lack a defined polyadenylation regulatory sequence 
and the reporter construct produces transcripts that extend well 
past the MSH4 3'-UTR sequence it is possible that we could be 
observing an effect related to transcription termination and post- 
transcriptional processing. We find this possibility highly unlikely 
since yeast mutants with defects in transcription termination and 
processing also show an inability to export the improperly 
processed RNA from the nucleus [28]. NMD occurs in the 
cytoplasm during translation so any defect in transcription 
termination or processing would be unable to target a defective 
transcript for NMD. 

In depth studies of the GFP-MSH4 3' UTR reporter showed 
that only the longest transcript is a target of NMD while five 
shorter transcripts are insensitive to NMD. To our knowledge this 
is the first known example of a gene producing multiple transcripts 
in which only a subset are targets of NMD in S. cerevisiae. All other 
targeting mechanisms involve recognition of a PTC by compo- 
nents of the surveillance complex, including Upf proteins and 
translation termination factors that mediate the steps in NMD 
leading to rapid RNA decay. For targeting by a 3'- UTR, the most 
logical stop codon recognized by the surveillance complex is the 
normal stop codon by virtue of its proximity to the 3' -UTR. If the 
context surrounding the normal stop codon had an effect on 3'- 
UTR-mediated targeting, we should have observed changes in 
accumulation and half-life for all of the transcripts produced from 
the GFP-MSH4-3' UTR reporter in Nmd~ strains. Instead, we 



found that the long LI transcript was sensitive to NMD based 
increased accumulation and decreased decay rate in Nmd 
strains, but the five shorter transcripts, SI through S5, were 
insensitive to NMD. In the aggregate, there was no change in 
abundance or half-life of the short transcripts in Nmd strains. 
Since the short and long transcripts share a stop codon with 
identical context we can confidently state that the context of the 
stop codon has no effect on NMD targeting via the 3' -UTR. 

3' end mapping showed that the SI, S2, and S3 transcripts 
terminated within the MSH4-3' -UTR, whereas S4, S5, and LI 
extended beyond the 3'-UTR into adjacent plasmid sequences. 
Since LI is NMD-sensitive but S4 and S5 are insensitive, plasmid 
sequences in the transcripts up to the S5 3' end are not per se 
responsible for NMD-targeting. LI terminates near the 3' end of 
the HIS3 ORF on the plasmid. Since HIS3 is present in opposite 
orientation to the reporter gene, it is possible but unlikely that the 
portions of the HIS3 gene included in the LI transcript contain 
signals for NMD targeting. The most likely model is that targeting 
is dependent on the length of the 3' -UTR. Since poly(A) binding 
protein plays a role in translation termination at the normal stop 
codon [29], long 3'-UTRs might disrupt the termination process 
due to the extended distance between the stop codon and poly(A) 
binding protein bound to the poly(A) tail. Put another way, the 
normal stop codon might be recognized as a premature stop codon 
by the surveillance complex when the 3' -UTR exceeds a 
maximum length required for efficient translation termination. 

Endogenous MSH4 is a target of NMD (Guan et al., 2006), but 
the endogenous gene cannot express a transcript equivalent to the 
NMD-sensitive LI transcript from the reporter, which includes 
plasmid sequences. However, RNA-seq experiments show that 
endogenous MSH4 produces multiple transcripts with different 3' 
ends [30,31]. We confirmed that MSH4 is a target of NMD by 
northern blotting, but could not resolve potential multiple 
transcripts on agarose gels even when the endogenous MSH4 is 
placed under the inducible CUP1 promoter and overexpressed 
(data not shown). We estimate transcripts that differ in length by 
up to 500 nt would migrate as a single, unresolved band. 
Resolution by PAGE was deemed unreliable due to the extremely 
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low abundance of MSH4 transcripts in vegetative cells. We have 
therefore not been able to confirm that endogenous MSH4 is 
targeted for NMD by a long 3'-UTR. 

Although we did not perform a detailed analysis of the role of 
the SP016 3' -UTR in NMD, we believe that a phenomenon is 
occurring similar to that observed for MSH4. Detailed studies have 
shown that SPOl 6 produces four transcripts with 3'-UTRs ranging 
from 147 to 747 nt in length [32]. From our studies of the MSH4 
3' -UTR, transcript S5, which is 285 nt in length, did not trigger 
NMD. If length is the only important parameter for NMD 
targeting, some of the SPOl 6 transcripts are too short to trigger 
NMD while others could be long enough and, according to our 
data, at least one is NMD-sensitive. We have no reason to believe 
that the MSH4- and SPOl 6 3'-UTRs target reporter transcripts for 
NMD because the endogenous genes perform functions in 
sporulation. In vegetative cells, both genes are expressed albeit 
at low levels, and might perform vegetative functions. Studies 
based on our reporter system cannot address whether NMD 
impacts gene expression in vegetative cells, sporulating cells, or 
both. 

Transcripts with long 3'-UTRs from any gene could be 
generated by inefficient termination of transcription or aberrant 
processing by the cleavage/polyadenylation complex, suggesting 
the possibility that transcripts with long 3'-UTRs are targets of 
NMD because NMD is performing an RNA surveillance function 
ultimately related to the efficacy of transcription termination. The 
link that would make this model attractive is if long 3'-UTRs 
caused a disruption of translation termination at the normal stop 
codon. While other targeting mechanisms prevent the accumula- 
tion of truncated proteins, disruption of translation termination at 
the normal stop codon by a long 3' -UTR might lead to 
translational read-through and the accumulation of proteins that 
are longer than normal. NMD might degrade abnormally long 
and potentially deleterious transcripts as an RNA surveillance 
mechanism. 

To test this model, we designed a western blotting assay based 
on the inclusion of epitope tags in MSH4 3' -UTR reporters so that 
proteins that end at the normal stop codon could be detected and 
distinguished from abnormally long read-through proteins. 
According to our calculations, read-through would have to occur 
at a threshold of about 3% in NMD sensitive transcripts when 
expressed from a multi-copy 2 p. plasmid in order for the read- 
through proteins to be detected. The mixture of target and non- 
target transcripts prevented us from achieving higher sensitivity. 
No translational read-through was detected. 

For premature stop codons embedded in a dual reporter, the 
frequency of translational read-through past a PTC can be as high 
as 2-3% in Nmd~ strains [27]. Read-through levels were shown to 
depend on which stop codon is used, where it is located, and the 
context surrounding the stop codon. Since no MSH4 3' -UTR- 
mediated translational read-through was detected using our 
immunological assay, we conclude either that the proposed link 
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